Journal article
Unsupervised discovery of microbial population structure within metagenomes using nucleotide base composition
I Saeed, SL Tang, SK Halgamuge
Nucleic Acids Research | OXFORD UNIV PRESS | Published : 2012
DOI: 10.1093/nar/gkr1204
Abstract
An approach to infer the unknown microbial population structure within a metagenome is to cluster nucleotide sequences based on common patterns in base composition, otherwise referred to as binning. When functional roles are assigned to the identified populations, a deeper understanding of microbial communities can be attained, more so than gene-centric approaches that explore overall functionality. In this study, we propose an unsupervised, model-based binning method with two clustering tiers, which uses a novel transformation of the oligonucleotide frequency-derived error gradient and GC content to generate coarse groups at the first tier of clustering; and tetranucleotide frequency to ref..
View full abstractGrants
Awarded by Australian Research Council
Funding Acknowledgements
Australian Research Council (grant number DP1096296); mud volcano metagenomics work was supported by the National Science Council of Taiwan (grant number NSC99-2627-M-002-010) Funding for open access charge: University of Melbourne.